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(57) Abstract 

A method and apparatus for CELP-based to CELP-based vocoder packet translation. The apparatus includes a form ant parameter 
translator and an excitation parameter translator. The formant parameter translator includes a model order converter and a time base 
converter. The method includes the steps of translating the formant filter coefficients of the input packet from the input CELP format to 
the output CELP format and translating the pitch and codebook parameters of the input speech packet from the input CELP format to the 
output CELP format. The step of translating the formant filter coefficients includes the steps of converting the model order of the formant 
filter coefficients from the model order of the input CELP format to the model order of the output CELP format and converting the time 
base of the resulting coefficients from the input CELP format time base to the output CELP format time base. 
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1 

CLEP TRANSCODING 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to code-excited linear prediction (CELP) 
speech processing. Specifically, the present invention relates to translating 
digital speech packets from one CELP format to another CELP format 

Related Art 

Transmission of voice by digital techniques has become widespread, 
particularly in long distance and digital radio telephone applications. This, in 
turn, has created interest in determining the least amount of information which 
can be sent over the channel while maintaining the perceived quality of the 
reconstructed speech. If speech is transmitted by simply sampling and 
digitizing, a data rate on the order of 64 kilobits per second (kbps) is required to 
achieve a speech quality of a conventional analog telephone. However, through 
the use of speech analysis, followed by the appropriate coding, transmission, 
and resynthesis at the receiver, a significant reduction in the data rate can be 
achieved. 

Devices which employ techniques to compress voiced speech by 
extracting parameters that relate to a model of human speech generation are 
typically called vocoders. Such devices are composed of an encoder, which 
analyzes the incoming speech to extract the relevant parameters, and a decoder, 
which resynthesizes the speech using the parameters which it receives over a 
channel, such as a transmission channel. The speech is divided into blocks of 
time, or analysis subframes, during which the parameters are calculated. The 
parameters are then updated for each new subframe. 

Linear-prediction-based time domain coders are by far the most popular 
type of speech coder in use today. These techniques extract the correlation from 
the input speech samples over a number of past samples and encode only the 
uncorrelated part of the signal. The basic linear predictive filter used in this 
technique predicts the current sample as a linear combination of the past 
samples. An example of a coding algorithm of this particular class is described 
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«mam et d. Proceedings of the Mobfle Satellite Conference, 1988 

THe function of the vocoder is to compress the digitized speech shmal 

m speech. Speech typically has short term redundancies due primarily to the 

£S£T£ ° f *: ^ "* ,0nSUe ' - JM * duet 
J Z . h 3 ^ C ° der ' *- °P eraaons « -ode.ed 

IsT^m7and ^ md * lon ^™ P«* «*«• °"ce 

*twrr u t "* T 0 ^' residual signal can be modeled 

as white gaussian noise, which is also encoded. 

filter, ""Z^Z ° f *!. tKhniqUe * to """P"* * e Parameters of tivo digital 
filters. One filter, called the format filter (also known as the "LPC (linear 

Z>tT ^ Mttr ' ^ * e P"* mter - P«*>rms long term 

prediction of fte speed, waveform. Finally, mese filters must be exdtea and 
4- - done by determining which one o, a number of random exotnon 
waveforms m a codebook results in to doses, approximation to the origina! 
J-A when the waveform exdtes the two niters mentioned above. ThusTe 

Digital speech coding can be broken in two parts; encoding and 
decoding, sometimes known as analysis and synthesis. FIG. 1 is a block 
diagram of a system 100 for digitally encoding, transmitting and decoding 

3T? L l w T MUdeS ' 10Z - 3 1M - - d a decoder 106 

Channel 104 can be a communications channel, storage medium, or the like. 
Coder 102 receives digitized input speech, exteacte the parameters describing 
the features of the speech, and quantizes these parameters into a source bit 
steeam that is sen, to channel 104. Decoder 106 receives me bi, stream from 
channel 104 and reconsiructe the output speech waveform using the quantized 
features in the received bitstieam. 

Many different formats of CELP coding are in use today. In order to 
successfully decode a CELP-coded speech signal, the decoder 106 must employ 
tile same CELP coding model (also referred to as "format") as the encoder 102 
that produced die signal. When communications systems employing different 
CELP formate must share speech date, it is often desirable to convert the speech 
signal from one CELP coding format to another. 

One conventional approach to mis conversion is known as "tandem 
codmg." FIG. 2 is a block diagram of a tandem coding system 200 for 
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encoders fi£u. C^ d ! ' ^ ^ " ^ ^ f °™ al 
*> heteinafter as' the Tpu^T^ h T~ " ^ (t *™ d 

format (ref^tohe^ Jt .^ ^T*f °~ ^ 
input signal to orodu~ „ v P format) ' Decoder 206 dec ° d <* the 

fo-t (referred to ^^TLX^t^ 1 ^ ^ 
signal in the outout n. V format) to produce an output 

perLtuaTde^Hnr diS * d ™>°& « «• approach is L 

SUMMARY OF THE INVENTION 

CELP b™ J*"™! taVena ° n * 3 meth0d md a PP aratu = for CELP-based to 
CELP-based vocoder packet translation. The apparatus includes a Z.! 
parameter transfctor mat teansla.es input formatter JE£* ^ 

•* u " 311 exatafa on parameter translator that translates 

ord« converter mat converts the model order of the input formal fflte 

of *e outputCELP format and a time base converter that converts the time bale 
of the mput formam filter coefficients from me time base of me inpToS 
formattothetimebaseoftheoutputCELPformat. * 

_ The method includes the steps of translating the formant filter 

foetid te , inpul r** from * e taput ^ *° ~p* 

format and translatmg me pitch and codebook parameter of the input speech 
input CELP forma, to .he output CEIP format. 1^ 
M^T me fonnan, filter coefficients includes the steps of translating the 
foment fflto coefficients from input CELP format to a reflection coefficient 
CELP forma,, converting me model order of the reflection coefficients from the 
model order of the input CELP format to the model order of the output CELP 
format, translating the resulting coefficients to a line spectral pair (LSP) CELP 
fonnat converting the time base of me resulting coefficients from the input 
CELP format time base to the output CELP format time base, and translate the 
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^Tr°fTT from ^ P l °™ * the output CELP forma, to produce 
output fo^t Mter coefficients. n, e step of translating the P L and 
codebook parameters inCudes the steps of synthesizing speecn using the input 
of T X °° k PMameterS to Pr ° dUCe * -arching foX 

sicssr — — *° -* - - «2 

d^,H^ adVM>tage ° f * e Present taven,ion * *** « eliminates the 
degradation a percept speech ouaU^ norma,,, induced by tandem cod fog 

BRIEF DESCRIPTION OF THE FIGURES 

beco™^ fealUreS ' 0b,eC,S ' •*»*•■» of the present invention will 
become more apparent from the detailed description set forth below when 
taken m conjunct™ with the drawings in which like reference characters 
identify correspondingly throughout and wherein: 

and de^p^f ^ * " ^ *" — 

an innu^T P ° f 3 for converting from 

an input CELP format to an output CELP format; 

FIG. 3 is a block diagram of a CELP decoder; 

FIG. 4 is a block diagram of a CELP coder; 

FIG. 5 is a flowchart depicting a method for CELP-based to CELP-based 
vocoder packet translation according to an embodiment of the present 
invention; 

FIG. 6 depicts a CELP-based to CELP-based vocoder packet translator 
according to an embodiment of the present invention; 

FIGS. 7, 8, and 9 are flowcharts depicting the operation of a formant 
parameter translator according to an embodiment of the present invention; 

FIG. 10 is a flowchart depicting the operation of an excitation parameter 
translator according to an embodiment of the present invention; 

FIG. 11 is a flowchart depicting the operation of a searcher; and 

FIG. 12 depicts an excitation parameter translator in greater detail. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The preferred embodiment of the invention is discussed in detail below. 
While specific steps, configurations and arrangements are discussed, it should 
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*ZfXlT^ k J'» a0 ~'"" , ~""™'^™"* A person skilled 

^^Ssitr^r-r 

^"sp^T e ^ . A a »°" - » CDMA wireless 

spread spectrum communication systems for telephone service 

■„,. „ P ^ en ' taVenH ° n " d6SCribed ta P^- Ftet a CELP codec 
mclud-ng a CELP coder and a CELP decoder, is described. TheTa patS 
translator is described according to a preferred embodiment 

exemnW cpfr"? 1 8 \ prefcned -*«»—* - implementation of the 
C^ .^f " ******** ^implementation, 

s^r a , "* ^ "**^** ™*hod to encode a speech 

^ S 10 mefl, ° d - ^ ° f «» *«* P-™— ™ com^S 
m»open-loop manner, while others are determined in a closed-loop mode by 
tnal and error. Specifically, the LPC coefficients are determined by saving a Z 

eaT^T, ? * e remamin8 P^ameters (codebook index, codebook 
gam, prtch lag, and pitch gain) are used with the forman, filter to synthesize a 
»P"ch signal The synthesized speech signal is then compared to the actual 
speech s,gnal to determine which of the hypothetical values of the remaining 
parameters synthesizes the most accurate speech signal. 

A Code Excited Linear Predictive (CELP) Decoder 

The speech decoding procedure involves unpacking the data packets, 
unquanozmg the received parameters, and reconstructing the speech signal 
from these parameters. The reconstruction consists of filtering the generated 
codebook vector using the speech parameters. 

FIG. 3 is a block diagram of a CELP decoder 106. CELP decoder 106 
mcludes a codebook 302, a codebook gain element 304, a pitch filter 306, a 
formant filter 308, and a postfflter 310. The general purpose of each block is 
summarized below. 

Formant filter 308, also referred to as an LPC synthesis filter, can be 
thought of as modeling the tongue, teeth and Hps of the vocal tract, and has 
resonant frequencies near the resonant frequencies of the original speech caused 
by the vocal tract filtering. Formant filter 308 is a digital filter of the form 
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6 

l/A(z)=l-a 1 2'-...-a n z^ (1) 



zzttzzzsr mtei 308 are refmed to - — - 

Pitch filter 306 car, be thought of as modeling the periodic Dulse tmin 
comtng from the vocal cords during voiced speech. Voiced s^ isCdu^ 

in ^v"nT ^ a " ,pleS ° f V ° i<:ed 80,11,118 are * e O in low" and the A 
m day Durmg unvoiced speech, the pitch filter basically passes the input to 

^2,™^ U ~ *«* - P^uced by foLg a*ti2gh ! 

TmT^^ VOCaI «"* Exam " leS °< voiced sounds „e 

o tne m shuffle, formed by a constriction between the lower Ho and 
upper teeth. Pitch filter 306 is a digital filter of the form P 

l/P(z) = l/(l.bz«-) = l + b 2 -' + b'z~ + ... 

where b is referred ,o the pitch gain of the filter and L is the pitch teg 0 , the 

unvo- C f tho "8 ht ° f - modeling the turbulent noise in 

unvotced speech and the excitation to the vocal cords in voiced speech. During 
background noise and silence, me codebook output is replaced by rand J 

vZrs C ^\ 302 ^ * nUmbeI ° f ^ WMds ^ * as codebook 
V JTT. C rf°° k Vectore « e according to a codebook index I. The 

selected codebook vector is scaled by gain element 304 according to a codebook 
gam parameter G. Codebook 302 may include gain element 304. The output of 
the codebook is then also referred to as a codebook vector. Gain element 304 
can be implemented, for example, as a multiplier. 

Postfilter 310 is used to "shape" the quantization noise added by the 
parameter quantization and imperfections in the codebook This noise can be 
noticeable in frequency bands which have little signal energy, yet might be 
■mperceptible in frequency bands which have large signal energy. To take 
advantage of this property, postfilter 310 attempts to put more quantization 
no.se mto perceptually insignificant frequency ranges, and less noise Into 
perceptually significant frequency ranges. This postfiltering is discussed 
farther in J-H. Chen & A. Gersho, "Real-Time Vector APC Speech Coding at 
4800 bps with Adaptive Postfiltering," in Proc. ICASSP (1987) and N.S. Jayant & 
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SSStSST Postmtoin8 of Speech/ ' in Proc - ICASSP — 

^Jl 0 """^ "* ^ ° f dl » Btad *«* «*— °~ or more 

306 and forma., ffl ter 308. Pitch filter 306 operates on the selected ctdebook 
vector accordtng to pitch gain b and pitch lag L Fonnant filter 308 o^ln 

a Z r * ^ ^ 306 to *»— *« 

... a, to produce synthesized speech signal .(n). 

A Code Excited Linear Predictive (CELP) Coder 

The CELP speech encoding procedure involves determining the input 
parameters for the decoder which minimize the perceptual different be^Ta 
^ ^ *■ aiginzed spL, signal, 

P T netaS ^ deSCribed ta *« f ° U °^ subsections. 
Tfeencodmg procedure also includes quantizing the parameters and packing 

JTitT 6 for ,raramiS5ion ' as would be apparcnl ,o one ^ ied * 

. ^ f It™ 3 bl °? d3 ** am °' 3 ^ COder 10Z ^ cod <" 102 ^eludes 
acodebook 302 a codebook gain element 304. a pitch filter 306, a formant filter 
3* a perceptua, weighting filter 410, an LPC generator 412, a summer 414, and 
a mmmuzauon element 416. CELP coder 102 receives a digital speech signal 
s(n) tha, » partitioned into a number of frames and subframes For each 
subframe, CELP coder 102 generates a set of parameters mat describe the 
speech stgnal in mat subframe. These parameters axe quantized and 
* CELP decoder 106. CELP decoder 106 uses these parameters ,0 
synthesize the speech signal, as described above. 

Referring to FIG. 4, the generation of LPC coefficients is performed in an 
open-loop mode. From each subframe of input speech samples s(n) LPC 
generator 412 computes LPC coefficients by methods well-known in the 
relevant art. These LPC coefficients are fed to fonnant filter 308. 

The computation of the pitch parameters b and L and codebook 
parameters I and G however, is performed in a closed-loop mode, often referred 
to as an analysis-by-synthesis method. According to this method, various 
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aStdL? t ^ ° f COdeb °° k "* PltCh P aramete - « Wlied to 
a CELP coder to synthesize a speech signal .(n). Ik. synthesized speech signal 

!T."!" ^ r(n > ** re «"* fr»»> this comparison is provided to 
~zanon element 416. Minimization element 416 select different 

wehm^H ^ emb ° dimem de P icted m HG ' 4 < *«e input speech samples s(n) are 
wetted ^ perceptuaJ ^ sQ tha( P 1 

samples are provided to sum input of adder 414. Perception weighST" 
uhhzed to weigh, the error a, the frequencies where there is Jess sign^ ^ 
It » at these low signal power frequencies that «he noise is more p^ceZanv 
nottceab le. This perceptua. weighting is further discussed in Urn ef ^ 
Paten, 5,414,796 enmled "Variable Rato Vocoder," which is Z^^Ty 
reference herein in its entirety. F 7 

Minimization element 416 conducts the search for the codebook and 
prtch parameters in two stages. First, rmnimization element 416 searches for the 
pitch i parameters. During the pitch search there is no contribution from the 
codebook (G = 0). In rnirumization element 416 all possible values for the pitch 
big parameter L and the pitch gain parameter b are input to pitch filter 306. 
Mmmuzabon element 416 chooses the values of L and b that nunimize the error 
r(n) between the weighted input speech and the synthesized speech 

Once the pitch lag L and the pitch gain b for the pitch filter are found, the 
codebook search is performed in a similar manner. Minimization element 416 
then generates values for codebook index I and codebook gain G. The output 
values from codebook 302, selected according to the codebook index I, are 
multiplied in gain element 304 by the codebook gain G to produce the sequence 
of values used in pitch filter 306. Ivlinimization element 416 chooses the 
codebook index I and the codebook gain G that minimize the error r(n). 

In one embodiment, perceptual weighting is applied to both the input 
speech by perceptual weighting filter 410 and the synthesized speech by a 
weighting function incorporated within formant filter 308. In an alternative 
embodiment, perceptual weighting filter 410 may be placed after adder 414 
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CELP-based to CELP-based Vocoder Packet Translation 

to as J^^ fo "^ diSCUSslon '* es I»«h P^et to be translated is referred 
to as the mpu," packet having an "input" CELP forma, ma, * 
~debook and pitch parameters and "input" formant filter coeffiZa 
Ukewrse, the result of the transition is referred to as the "output" paTk* 

™r r" CEU> fom,at *" codeboot and p tch 

parameters and "output" formant filter coefficients. One useful application o] 

t™ 6 ; "If? ° f *" taPl " ^ « «»"*« from the 

input CELP forma, to the ou,pu, CELP forma,, as shown in step 502. In me 

Z£ £f I""* COdeb °° k Parametere of *« *P U ' Packet are 

stotT t° * „T folma ' *° 1116 ^ ^ fo ™"< - *»» « 

ouL^, p S,88e ' ^ ° UtpUf P Mamel ^ «• quantized w»h me 

output CELP quantizer. 

FIG. 6 depicts a packet translator 600 according to a preferred 
embodament. Packet translator 600 includes a formant parameter translator 620 
and an excitation parameter translator 630. Formant parameter translator 620 
translates the input formant filter coefficients to the output CELP format to 
produce output formant filter coefficients. Formant parameter translator 620 
includes a model order converter 602, a time base converter 604, and formant 
filter coefficient translators 610A,B,C. Excitation parameter translator 630 
translates the input pitch and codebook parameters to the output CELP format 
to produce output pitch and codebook parameters. Excitation parameter 
translator 630 includes a speech synthesizer 606 and a searcher 608. FIGS 7 8 
and 9 are flowcharts depicting the operation of formant parameter translator 
620 according to a preferred embodiment. 

Input speech packets are received by translator 610A. Translator 610A 
translates the formant filter coefficients of each input speech packet from the 
input CELP format to a CELP format suitable for model order conversion. The 
model order of a CELP format describes the number of formant filter 
coefficients employed by the format. In a preferred embodiment, the input 
formant filter coefficients are translated to reflection coefficient format as 
shown in step 702. The model order of the reflection coefficient format is 
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tnmsla^lll 0 "'? 602 rereiTCS »■«*» confidents from 

CHIP fann* as 1™ toste^ m h ? *T ^ ° f *« ° U * ut 

interpolator 612 P 0rder converter «H includes an 

interpolator biz and a declinator 614 Whon th» j ,„ . 

format is tower a,™ m . , . ^ model orde r <* *e input CELP 

612 uerftJTT ™T , m0deI order °f the output CELP format, interpolator 

Z^T^Tn^T^ t0 ^ ^cilte, as 

^^2^^^^^'^ are sel to 2610 

of the m,h>„* to., me mput CELP format is higher than the model order 
of fte output CELP forma,, dedmator 614 performs a decimation operation to 
reduce the number of coeffidents, as shown in step 804. In one JbodinLT 
Ate unnecessary coeffidents are simp. y replaced byzeroes. Such ^ 
and decimation operations are well-known in the relevant arts Z 
ooefodent reflection domain mode., order conversion TTeLtiv^y 1£ 

CELP formats are the same, model order conversion is unnecessary 

from IT, !, 6108 reC6iVeS * 6 °rder.«>rrected formant filter coeffidents 

rXt^n lffl f r VCrter ^ ^ *- * e coeflicients from £ 
reflection coeffiaen. forma, to a CELP forma, suitable for time base conversion 
TOe tune base of a CELP format describes the rate a, which the formal, 
ynmeas parameters are sampled, i.e., me number of vectors per second" 
formant synthesis parameters. In a preferred embodiment, me "flection 

TO. Methods for performing such a translation are well-known in the relevant 

ftiim T T* '** C ° nVerter 604 r6CeiVeS 1,16 ^ coefficient from translator 
610B and converts the time base of the LSP coeffidents from the time base of the 

ZT™ r T' to ^ baS<i ° f ° UtpUt CTLP fo ->at. as shown in 
step m base converter 604 indudes an interpolator 622 and a dedmator 
62* When me tone base of me inpu, CELP for™, to lower ^ me ^ baM 
of me output CELP forma, <i.e, uses fewer samples per second), interpolator 
622 performs an interpolation operation to increase tire number of samples, as 
shown m step 902. When the time base of foe inpu, CELP forma, is higher man 
the model order of the outpu, CELP format (U., uses more samples per 
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mp , , relev ant arts. Of course, if the time base of the inDut 

w co^i; e toe b ~ of fte -* — * rr 

coeffldlr^mtoT ^ ^ dormant filter 

LSP fon^oT C ° nVerler 6M Md *• "-ts from the 

^ s rrrL^ fo rco to pr r: ou,put f ™ — 

employs LSP fa™JT . . ,f * e out P ut CELP fo ™« 

3,^ , f0 T ,t * tor .~ ffld -*< *is translation is unnecessary. 

«T»r 6 ° U,PUt f0Iman, filter from translator 

610C and quannzes the output forman, filter coefficients, as shown in step 712 

(afeo refe^T ^ " ^ ™ d COdeb °° k 

(also referred to as "excitation" parameters) of the input speech packet are 

«n fl ° WChart de P ictta S * e «P—*« of excitation parameter 

translator «0 according to a preferred embodiment of the present invLon. 

Refemng to FIG. 6, speech synthesizer 606 receives the pitch and 
codebook parameters of each input speech packet. Speech synthLer 606 

f™;,^ I' 5 " 41 - ™ to M * e W *e output 

translator 620, and the input codebook and pitch excitation parameters as 

*Z o r 7 T ^ m step 1004 ' 608 * " 

codebook and Pitch parameters using a search routine similar to that used by 
pa^aLterr" 64 3bOVe - Sea " her 608 *" qUanti2es *• ° ut P u « 



FIG. 11 is a flowchart depicting the operaHon of searcher 608 according 
ulZ "™i emh ° di ™* °< * e P~ * "venHon. In this search, searcher 608 
uses the output fbrmant niter coefficients generated by formant parameter 
trandator 620 and the target signal generated by speech synthesizer 606 and 
cand.date codebook and pitch parameters to generate a candidate signal as 
shown » step 1104. Searcher 608 compares the target signal and the candidate 
signal to generate an error signal, as shown in step 1106. Searcher 608 then 
vanes the candidate codebook and pitch parameters to minimize the error 
signal, as shown in step 1108. The combination of pitch and codebook 
parameters that minimizes the error signal is selected as the output excitation 
parameters. These processes are described in greater detail below. 
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d-J^ I PiCtS CXCitati0n Parameter 630 in greater detail. As 

t^T ^ excitation parameter translator 630 includes a speech 
svn*esxzer 606 and a searcher 608. Referring to FIG. 12, speech synthesizer 606 
mcludes a codebook 302A, a gain element 304A, a pitch filter 306A, and a 
formant niter 308A. Speech synthesizer 606 produces a speech signal based on 
excitation parameters and formant filter coefficients, as described above for 
decoder 106. Spedfically, speech synthesizer 606 generates a target signal 
usmg the input excitation parameters and the output formant filter coefficients. 
Input codebook index I, is applied to codebook 302A to generate a codebook 
vector. The codebook vector is scaled by gain element 304A using input 
codebook gain parameter G,. Pitch filter 306A generates a pitch signal using the 
scaled codebook vector and input pitch gain and pitch lag parameters b, and L_ 
Formant filter 308A generates target signal using the pitch signal and the 
output formant filter coefficients a., ... aon generated by formant parameter 
translator 620. Those of skill would appreciate that the time base of the input 
and output excitation parameters can be different, but the excitation signal 
produced is of the same time base (8000 excitation samples per second, in 
accordance with one embodiment). Thus, time base interpolation of excitation 
parameters is inherent in the process. 

Searcher 608 includes a second speech synthesizer, a summer 1202, and a 
minimization element 1216. The second speech synthesizer includes a 
codebook 302B, a gain element 304B, a pitch filter 306B, and a formant filter 
308B. The second speech synthesizer produces a speech signal based on 
exatation parameters and formant filter coefficients, as described above for 
decoder 106. 

Specifically, speech synthesizer 606 generates a candidate signal s (n) 
usmg candidate excitation parameters and the output formant filter coefficients 
generated by formant parameter translator 620. Guess codebook index ^ is 
applied to codebook 302B to generate a codebook vector. The codebook vector 
is scaled by gain element 304B using input codebook gain parameter G c . Pitch 
filter 306B generates a pitch signal using the scaled codebook vector and input 
pitch gain and pitch lag parameters b G and 1^. Formant filter 308B generates 
guess signal s G (n) using the pitch signal and the output formant filter 
coefficients a^ ... a^. 

Searcher 608 compares the candidate and target signals to generate an 
error signal r(n). In a preferred embodiment, target signal s^n) is applied to a 
sum input of a summer 1202, and guess signal s c (n) is applied to a difference 
mput of summer 1202. The output of summer 1202 is the error signal r(n) 
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Em,r signal r(n) is provided »o a minimization element 1216 . 
Mmrmtzauon element 1216 selects different combinations of codebook and 
ptteh parameters and determines the combination that rnmimi.es errTsi^l 

lir^^T", 10 * 3 ' dCSCribed ab ° Ve ^ to — 
rroTm L k Uft ^ COdeb °° k " nd P i,ch ««s that result 

from ft. search are quantized and used with the forman, filter coefficients that 

translator 600 to produce a packet of speech in the output CELP format. 
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Conclusion 14 



en,M. ^ descri P« on ° ( *e Preferred embodiments is provided to 

enaMe^ any person skilled in the art to make or use me present invention Tne 

s^TiTmt ^ * ^ 6mb ° dimentS 1,6 ^ W to those 
rZ u*T. ' * C 8enerk pril,d P ,es defined here " ™y be applied to 
ofter embodiments without me use of the inventive faculty. Tnus thfpresem 

tor"" Z T ^ l ° " 6 ltai,ed '° * e -bodimen^hoJhe^in" 
to be accorded the widest scope consistent with the principles and novel 
features disclosed herein. 
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What Is Claimed Is: 
1. 



is 

CLAIMS 



one code exci^lL P r at ^ f0r 3 COm P ressed packet from 

code excited Imear prediction (CELP) format to another, comprising: 

to an output CELP fTrTT, ^ corresponding to a speech packet 

a?elc^f Pr ° dUCe ° UtpUt f0rmant mter efficients; and 

v Parameter translator that translates input pitch and 

Z^JZT^" " *** CELP folmat - d -resp'ondmgto s*d 
Z^ZllT ° UtPUt CEU> format t0 Pr ° dUCe «*" -d 

tnmsJorcom^r^ * ^ " - <™ P~ 
a model order converter that converts the model order of said inr>», 

mooTo d ^ ^ 4 m ° dd ^ ° f Said -pmaJfoltTa 

model order of said output CELP format; and 

filter co^ ^1 C ° nVerter COnVertS ** base of Said -Put Want 

S ^~rr e w ° f ^ faput celp *— - * — °< 

^r C o^ m ° f daim 2 ' ^ Mid P-meter 

and J SyntKeSiZer ** Pr ° dUCeS 3 ^ si S- al said input pitch 

and codebook parameters and said output f ormant filter coefficients; and 

a searcher that searches for said output codebook and pitch parameters 
using said target signal and said output formant filter coefficients. 

4. The apparatus of claim 3, wherein said searcher comprises- 

exrit « SPCeCh Synthesizer ** * guess signal using guess 

excitation parameters and said output formant filter coefficients- 

a combiner that generates an error signal based on said guess signal and 
said target signal; and 

a miiumization element that varies said guess excitation parameters to 
minimize said error signal. 
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furthe/co^ 3 ™ ° f * ta 3 ' — convener 

ffl*, 3 ^ C ° effidenl ' IanSlat0r *»' *>id input formant 

tohel^: 3 ™ ° f daim 5 ' WhGrein ^ m ° del — ~ 

_ J" * te *P° lat <* *at interpolates said third coefficients to produce order 
corrected coefficients when said model order of said input c£p formal 
lower man said model order of said output CELP format;^md 

a decimator that decimates said third coefficients to produce said order 
corrected coefficients when said model order of said input CELPW* Z 
higherthansaidmodelorderofsaidoutputCELPformat. 

comprises: ^ * ^ * " U ^izer 

a codebook using said input codebook parameters to produce a 
codebook vector; F a 

vprtor ' Pitd \ mt6r ^ Said ^ P itch ^ter parameters and said codebook 
vector to produce a pitch signal; and 

a fa J™ mt using said output formant filter coefficients and said 
pitch signal to produce said target signal. 

r*nJl / PpaiatUS ° f daim 7 ' whe «** »* guess excitation 

parameters mclude guess pitch filter parameters and guess codebook 
parameters, wherein said further speech synthesizer comprises: 

a further codebook using said guess codebook parameters to produce a 
further codebook vector; 

a pitch filter using said guess pitch filter parameters and said further 
codebook vector to produce a further pitch signal; and 

a formant filter using said output formant filter coefficients and said 
further pitch signal to produce said guess signal. 

9. The apparatus of claim 2, further comprising: 

a first formant filter coefficient translator that translates said input 
formant filter coefficients to a fourth CELP format before use by said time base 
converter. 
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10. 



The apparatus of claim 2, further comprising: 

said <L72 d f0nnant ^ C ° effident *at translates the output of 

"IT ^ ^ ^ to said output CELP 

linesp^alp^^ ^ * **** ^ *"* ^ * * 

CELPfoLtf me t.° d C ° nVertinS a com P^sed speech packet from one 
CELP format to another, comprising the steps of: 

(a) plating input formant filter coefficients corresponding to a speech 
packet from an input CELP format to an output CELP format to produce output 
formant filter coefficients; and F output 

Cb) translating input pitch and codebook parameters corresponding to said 
speech packet fi om said input CELP format to said output^ format to 
produce output pitch and codebook parameters. 

14. The method of claim 13, wherein step (a) comprises the steps of • 
(0 converting the model order of said input formant filter coefficients from 

for^t and ^ **** ^ t0 3 ^ ° f Said ° Ut P Ut CELP 

(ii) converting the time base of said input formant filter coefficients from a 
time base of said input CELP format to a time base of said output CELP format. 

15. The method of claim 14, wherein step (b) comprises the steps of 

synthesizing speech using said input pitch and codebook parameters in 

said mput CELP format and said output formant filter coefficients to produce a 
target signal; and 

searching for said output pitch and codebook parameters using said 
target signal and said output formant filter coefficients. 

16. The method of claim 14, wherein step (i) comprises the steps of 
translating said input formant filter coefficients from said input CELP 

format to a third CELP format to produce third coefficients; and * * 
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of s JZTS pT 0 " 61 ° rder ° f ^ *" COeffidentS fr ° m a mo ^ order 
said input CELP format to a model order of said output CELP format to 
produce order corrected coefficients. * 0 

rrLl ^ me ?° d ° f d3im 16 ' WhCrein StC P (U) COm P rises steps of: 
fourth SSS ^ " COrreCt6d COeffidentS t0 3 foUrth *™« * P-duce 

said in^Sf ^ baSC ° f ^ fOUrth C ° effiCientS form 3 time base ^ 
said mput CELP format to a time base of said output CELP format to produce 
time base corrected coefficients; and proauce 

tosaidtZ™^ W W C ° efficiente fro - fourth format 

said output CELP format to produce said output formant filter coefficients. 

thestjof: ^ meth ° d ° f ^ 15 ' S3id *P comprises 

generating a guess signal using guess codebook and pitch parameters 
and said output coefficients; parameters 

signalXd 6 ^ ^ ^ ^ ^ Signal *"* said ta ^et 

error si^T 5 ^ **** C ° deb °° k and P itch Parameters to rninimize said 

steps of 9 ' ™ e meth ° d ° f Claim 16 ' wherein ste P « ^ther comprises the 

interpolating said third coefficients to produce said order corrected 
coeffiaents when said model order of said input CELP format is lower than 
said model order of said output CELP format; and 

decimating said third coefficients to produce said order corrected 
coefficients when said model order of said input CELP format is higher than 
said model order of said output CELP format. 

20. The method of claim 16, wherein said third CELP format is a 
reflection coefficient CELP format. 

21. The method of claim 17, wherein said fourth CELP format is a line 
spectral pair CELP format. 
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ISJt!?^7 E FORMANT F"-TER COEFFICIENTS FROM 
INPUT CELP FORMAT TO REFLECTION COEFFICIENT 
CELP FORMAT 



CONVERT THE MODEL ORDER OF THE REFLECTION 
COEFFICIENTS FROM THE MODEL ORDER OF THE 
INPUT CELP FORMAT TO THE MODEL ORDER OF THE 
OUTPUT CELP FORMAT 



TRANSLATE COEFFICIENTS TO A LINE SPECTRAL 
PAIR (LSP) CELP FORMAT 



CONVERT THE TIME BASE OF THE LSP COEFFICIENTS 
FROM THE INPUT CELP FORMAT TIME BASE TO THE 
OUTPUT CELP FORMAT TIME BASE 



TRANSLATE THE COEFFICIENTS FROM LSP FORMAT 
TO THE OUTPUT CELP FORMAT TO PRODUCE 
OUTPUT FORMANT FILTER COEFFICIENTS 
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